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Spectral Detection on Sparse Hypergraphs 
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Abstract — We consider the problem of the assignment of 
nodes into communities from a set of hyperedges, where every 
hyperedge is a noisy observation of the community assignment 
of the adjacent nodes. We focus in particular on the sparse 
regime where the number of edges is of the same order as 
the number of vertices. We propose a spectral method based 
on a generalization of the non-backtracking Hashimoto matrix 
into hypergraphs. We analyze its performance on a planted 
generative model and compare it with other spectral methods 
and with Bayesian belief propagation (which was conjectured to 
be asymptotically optimal for this model). We conclude that the 
proposed spectral method detects communities whenever belief 
propagation does, while having the important advantages to 
be simpler, entirely nonparametrlc, and to be able to learn the 
rule according to which the hyperedges were generated without 
prior information. 

1. Introduction 

Detecting information about the vertex properties that is 
hidden (or encoded) in the structure of a graph is a central 
issue in many problems in physics, biology and computer 
science. In fact, many systems of interest are composed 
of a large number of variables about which we do not 
have any information but the relationships (or part of the 
relationships) between them. Starting from this knowledge 
we aim at inferring individual properties of the nodes. 

In this context, the main approaches are statistical in¬ 
ference, where the detection is based on the assumption 
of a generative model for the graph [1], [2], and spec¬ 
tral methods [3]. For some classes of generative models, 
statistical inference methods based on belief propagation 
were predicted to be optimal in detecting planted hidden 
configurations [4], [5], [6]. Spectral methods have the great 
advantage of being non-parametric, meaning that they do 
not require any knowledge of the generative model (if one 
exists) and are completely independent of it. Nonetheless, 
standard spectral methods such as the adjacency matrix or 
the Laplacian succeed down to the information theoretical 
limit when the graph is sufficiently dense or regular [7], [8], 
[9], [10], [11] while tend to fail when graphs are sparse due 
to their sensitivity to fluctuations in the vertex degree. 
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These problems have been well studied in the case of 
graphs with simple edges between couples of vertices. How¬ 
ever, many networks have a different structure, and the 
relationships between vertex-variables are not established in 
couples but in A:-uplets, with k > 2. An exemple is given, 
for instance, by the network of scientific collaborations, of 
skype conference calls, email exchanges or recommendation 
systems where we can associate a user with a specific 
content and a rating. Translating the hypergraph into pairwise 
interaction would inevitably lead to a loss of information, and 
therefore some effort has been made to generalize spectral 
methods to multi-body interactions [12], [13], [14]. 

Here we study an extension of the spectral clustering 
method proposed in [15] based on a generalization of the 
non-backtracking matrix [16], [15] to the case of hyper¬ 
graphs (or factor graphs), that we argue to be effective on 
sparse networks. To test the performance of the spectral 
method we study it on a generative stochastic block model of 
hypergraphs similar to the ones defined in [17], [12] which 
is relevant for different problems ranging from community 
detection to planted constraint satisfiability. We compare the 
results with statistical inference based on belief propagation, 
which we also derive, and show the intimate connections 
between the latter and the non-backtracking operator. We 
also illustrate that on the sparse networks we consider, other 
spectral methods based on standard operators fail in the 
detection where the method we propose succeeds. 

A particularly remarkable point about the method is that 
it works without any prior knowledge of the generative 
model or its parameters, it is hence a promising tool to 
learn the probabilistic rules that were used to create the 
hypergraph. We illustrate this on the example of planted 
constraint satisfaction problems where information about the 
nature of the constraints is not assumed but inferred. 

The paper is organized as follows: In Sec. II we give 
the form of the generalized non-backtracking matrix and 
summarize the algorithm. In Sec. Ill we present the gen¬ 
erative model. In Sec. IV we then discuss the performance 
of the algorithm on hypergraphs generated by the model. 
In Sec. V we derive the belief propagation algorithm and 
the detectability phase transition by linearization around the 
uniform fixed point. In Sec. VI-A and VTB we apply the 
spectral algorithm to two specific cases of the generative 
model, namely an assortative stochastic block model and 
the planted 2-in-4-sat, comparing its behavior and phase 
transitions with the one of belief-propagation. Finally in Sec. 
VII we give our conclusions. 




Fig. 1. A graphical representation of one non-zero element 

of the generalized non-backtracking matrix, where grey squares represent 

hyper-edges (or factors) and white circles are vertices. 


II. Spectral Detection Algorithm 


Consider a hypergraph G{V,E), with vertices V with |y| = 
N, and hyperedges E, IZsl =M. We denote by djj. the set of 
vertices included in the hyperedge jj,. Each vertex i has an 
associated variable a, that can take values in the set £/ = 
{O,--- ,q— 1}. These variables are hidden from us, but it 
is assumed that the hyperedges were chosen in a way that 
depends on the values of these variables. The goal is to infer 
the variables from the structure of the hypergraph. 

The spectral algorithm we propose to detect hidden labels 
of the vertices in a sparse hypergraph is based on the 
following hypergraph non-backtracking operator 






I 1 if ;■ e dn\i 
I 0 otherwise, 


( 1 ) 


where i,j ,N are vertex indices and = I,--- ,M 

are hyper-edges (or factors). In Fig. 1 we show a graphical 
representation of one non-null element of the matrix. Vertices 
are represented by circles and hyperedges by squares. Each 
hyper-edge can be seen as a group of edges going from the 
participating vertices to the factor /t. The matrix is therefore 
of size kM x kM, where M is the number of factors and 

lYA ^ 


is the average degree of a factor. This matrix has multiple 
advantages with respect to the use of adjacency or incidence 
matrix, in particular the fact that it is non-backtracking 
inhibits the eigenvalues linked to high degree vertices and the 
bulk of the spectrum is confined in a circle in the complex 
plane for large random networks. 

The algorithm is the following; 

• Given the hypergraph, construct the non-backtracking 
matrix according to (1). 

• Compute the largest norm eigenvalues (and associated 
eigenvectors) down to the first that has a non-zero 


imaginary part. In this way we obtain q eigenvalue- 
eigenvector parrs. Given that the first eigenvector is 
associated to the degree of the vertices, we retain the 
subsequent g' — 1 to identify a partition in q groups. 

. For each of the q eigenvectors v construct an N com¬ 
ponent vector u as 

> (3) 

gedi 

where di indicates the set of factors to which vertex i 
participates. 

• Run the preferred clustering algorithm, for example soft 
k-means, on the components of the q — I eigenvectors 
u to obtain a partition of the vertices in q groups. 

The algorithm we propose can be applied to any kind 
of hypergraphs with any distribution of vertex and factor 
degrees. 

The other spectral method we consider for comparison is 
based on the adjacency matrix A, whose elements aij are the 
number of hyper-edges containing both the vertices i and 
j. The adjacency matrix of a hypergraph can be written in 
terms of the incidence matrix H as A — E[E[^ — D, where the 
incidence matrix is an V x M matrix such that 


Hn,= 


i G djJ., 
otherwise. 


(4) 


In the following we specify a generative model on which 
we will analyze the performance of our non-backtracking 
algorithm comparing it with the ground truth and other 
algorithms. 


III. The Model 

Consider a set of vertices V with |y| = N, where each 
vertex variable a, can take values in the set .e/ = {0, • • • ,g' — 
1}. Each vertex is independently assigned a value a G 
with a probability «« such that La = 1 ■ 

Consider now a kernel function ^ -G IR[ 0 ,i] that 
is symmetric under any permutation of the arguments and 
represented by the symmetric k-tensor p (although gener¬ 
alization to non-symmetric kernels should not present any 
particular difficulty). Let be the set of all the possible 
hyper-edges of degree k between the N vertices, the kernel 
tensor gives the probability of existence, independently, for 
any hyper-edge in To every hyper-edge in is 

associated an indicator variable with /i = 1, • • • , , with 

probability 

Prob(e^ = l|fl^) =Pa^ , (5) 

where G is the set of labels planted on the vertices 
participating to the hyper-edge gL. The hypergraph is fully 
specified by the vector e. The expected number of edges in 
the hypergraph is given by 


M= E 


E 




M 


(6) 







We are interested in the sparse case, i.e. M = 0{N), therefore 
the elements of the tensor must scale as 

Pa = ^ , (7) 

where Ca = 0{l). In the following, with a slight abuse of 
notation, a will indicate a variable in or 

which one of the three will be clear in the context. 

In the large N limit at leading order the expected degree 
of a node with a label a is 

1 

^ Ca,bY\nb,. ( 8 ) 

bes^k-i /=i 

It will also be useful in the following to define the two- 
vertices average degree, namely 

j k-2 

~ (k—ow ^ n 

ses^i<-2 i=i 


Given the planted assignment {a,}, the conditional probabil¬ 
ity of generating a certain hypergraph G specified by the set 
of indicator variables {e^} is 


P{G\{ai}i=i_^) 


n 



( 10 ) 

As anticipated in the introduction, this generative model 
covers a wide range of problems of which we give some 
examples below. 


• Planted Constraint Satisfaction Problems. Constraint 
satisfaction problems play a crucial role in theoretical 
and applied computer science as well as in engineering 
and physics due to their very general nature. In a CSP 
we consider a set of N discrete variables, typically 
Boolean, subject to a set of M constraints. In many of 
the usually considered cases like k-SAT or k-XORSAT 
or A:-in-2A:-SAT these constraints are all of the same 
type. In k-SAT, for example, the OR between M k- 
uples of variables (or their negations) must result to 
TRUE. In a random CSP the constraints are thrown at 
random between groups of variables, giving raise to a 
random graph. The question is if we can find a satisfying 
assignment. In a planted CSP [18], [19], [20], [21] we 
throw at random an assignment of the variables and 
then a series of constraints (factors or hyper-edges) that 
are satisfied by the assignment itself. The question is if 
and how, given the graph, we can recover the planted 
assignment. Planted CSPs are covered by the above 
generative model. 

• Hypergraph Stochastic Block Models. The stochastic 
block model is a popular way of generating graphs with 
a community structure. An ensemble of N vertices is 
labeled with values from 1 to ^ depending on which 
of the q communities they belong to. Given this as¬ 
signment, edges between couples of nodes are thrown 
at random with a probability (kernel) that depends 
on the labeling of the two nodes that participate to 


the edge. Two typical choices are the assortative case 
where nodes belonging to the same community are more 
likely to be connected or disassortative if the case is 
the opposite. Again, the objective is, given the graph, 
recover the underlying community structure. Our model 
is the natural generalization of the stochastic block 
model to the case of hypergraphs, which is relevant 
in many applications, from recommendation systems to 
co-authoring networks. 

• Coding. If we allow, in the planted CSP scenario, the 
constraints to be soft, meaning that they can be violated 
with a certain finite probability, the hypergraph can 
be seen as a noisy observation of the data, where the 
data consist in the planted assignment. In many cases 
encoding is performed by summing a random set of 
variables and transmitting the sum through a noisy 
channel. The choice of the sets results in a random 
hyper-graph with a structure that is determined by the 
code construction and the transmission gives a noisy 
version of this hypergraph. Our generative model can 
also be seen as a representation of this kind of setting. 
For the analysis of this paper we will consider the case 
where the average degree of the vertices is independent of 
the labeling, namely Ca = c. This is the statistically hardest 
case, because simply observing the degree of a node does 
not give any information about its planted variable. In doing 
so we obtain a random sparse hypergraph with structure 
but in which degrees of all the vertices come from Poisson 
distribution with average c. 

The performance of a detection algorithm can be evaluated 
through a measure of the normalized overlap between the 
planted assignment and the inferred one, namely 

]V ^aj,K{dj) ItiaXa 


Q = max - 


( 11 ) 


1 - max„ Ha 

where a is the planted assignment, a is the inferred assign¬ 
ment and n is any permutation of the labels. 


IV. Properties of B on the planted model 

In this section we present the properties of the non¬ 
backtracking matrix for a hypergraph generated with the 
above planted model. 

First we remark that the hypergraph can be seen as a bi¬ 
partite graph between the variable nodes and the hyperedges. 
In the case this bipartite graph is random (non-planted, but 
fixed degree sequence), results derived for the spectrum of 
the non-backtracking operator of this bipartite graph in [15], 
[22] translate directly to the present case. 

The largest eigenvalue of B is, asymptotically in the large 
N limit, equal to the average branching factor of the locally 
tree-like hypergraph. For a k-regular hypergraph this reads 

Bi = {k-l)£Q[d], (12) 

where Qd = {d + l)pd+i/c is the excess degree distribution 
and pd is the degree distribution. Since we are considering 
Poissonian degree, for the first eigenvalue we find 

Pi =c(k- 1). 


(13) 



The bulk of the spectrum is confined in the circle of radius 

p^^c{k-l). (14) 

This can be seen by considering that for any matrix B with 
eigenvalue p 

kM 

<Tr{B^WY. (15) 

1=1 

Moreover, for any fixed r and in the limit N ^ o°, the hy¬ 
pergraph is locally tree-like, therefore the diagonal elements 
(/ —> jJL){i —> jx) of (Z?'‘)(Z?'')^ count the exact number of 
factor nodes at r steps from p. through paths not including 
/, which in expectation is {k— Therefore for the trace 
we obtain 


E[Tr(B'')(B'')^] =kM(k-l)V, (16) 

which gives 

£[\pY]<{k-\yc'-. (17) 

Since this relation is true for any fixed value of r, we con¬ 
clude that almost all the eigenvalues of the non-backtracking 
matrix lie in the circle of radius (14). More refined analysis 
of [22] leads to the result that for a random hypergraph all 
but one eigenvalue lie in that circle. 

The planted model can be seen as a perturbed rank-r 
matrix. If the rescaled eigenvalues of the non-perturbed rank 
r matrix fall outside of the circle confining the bulk then 
they are also visible on the real axes in the spectrum of the 
non-backtracking operator. Hence, in analogy with [15], [22] 
the second q—l eigenvalues, when exceeding the bulk, are 
associated to eigenvectors that are correlated with the planted 
configuration and take values 


P 2 =c{k-\)X, (18) 


where X is one of the largest q—\ eigenvalues of the matrix 


T^ab — kla 


^ab 

c{k— 1) 


(19) 


which, as we will see, are degenerate under appropriate 
conditions. 

Inference of the planted assignment can be performed 
through a standard clustering algorithm like k-means applied 
to these eigenvectors when the informative eigenvalues ex¬ 
ceed the bulk, namely 


\P2\ > Vc{k-1). ( 20 ) 


The vectors that we cluster are constructed from the eigen¬ 
vectors of B by summing all the outgoing edges for each 
vertex. 

The algorithm hence requires to find the leading eigen¬ 
values of a kM X kM matrix, which can be very large if 
the average degree of the nodes is high. Nonetheless, if the 
hypergraph is k-regular as we are considering, we can reduce 
the size of the problem. The eigenvalue equation we are 
solving is the following 

y^, y^, ’^k^v =• (21) 

vedi\likedv\i 


Let us consider the sums of incoming and outgoing mes¬ 
sages, respectively 

v;"= E E 

vedikedv\i 

vr= E^<-v, 

vedi 

for which the eigenvalue equation (21) translates into 

idi-i)xY=xxr, 

E E v7-(k-2)vr-(k-i)vr=^v“. 

fJ.€di j€d^\i 

The preceding equation can be written in a compact form as 


( 22 ) 


(23) 


with 


and 


B'v = X\, 


V = 



/ 0 D-1 \ 

\-{k-l)t A-{k-2)l)^ 


(24) 

(25) 

(26) 


where is a 2N x 2N matrix, D is the diagonal degree 
matrix A is the symmetric adjacency matrix. Given this, we 
see that all the eigenvalues of the complete non-backtracking 
operator B are also eigenvalues of the reduced one B' except 
for those kM — 2N associated to the subspace defined by 

E E ^k^v=0 Vi, 

vedikedv\i 

El^v= 0 Vi, 

vedi 


for which we cannot assure the correspondence. Moreover, 
by a further transformation of the eigenvalue eq. (21), we 
can obtain the following non-linear equation describing the 
eigenvalues of the reduced operator 


1 

^ A(p—2+^) 



A 

P-1 


E E 

ll€dik€:dil\i 


(28) 


telling us that the differences between the B and B' spectra 
will be located in A = 1 and X = —{p — 1) due to the 
singularities that appear in the formula. 


V. Belief Propagation and Phase Transitions 

In this section we derive the asymptotic properties of 
the generative model through belief propagation and, as 
a byproduct, we highlight the connection between belief 
propagation and the generalized non-backtracking operator. 

Given the conditional probability of a graph (10), accord¬ 
ing to the Bayes rule the posterior probability of the planted 
assignment a, given the hypergraph G reads 

1 ^ 




80 



Fig. 2. The spectrum of the hypergraph non-backtracking matrix (1) for an hypergraph of size N — 1800 generated through the assortative HSBM with 
k — 3, q — 3 and c — A for (a) e = 0.22 and (b) e = 0.14. In (b) the informative eigenvalues lie outside the bulk and are degenerate, (c) The spectrum of 
the adjacency matrix for £ = 0.14. Despite being in the detectable phase the informative eigenvalue of the adjacency matrix is hidden in the bulk. 


where Z(G) is the proper normalization constant. This poste¬ 
rior probability is associated to a graphical model for which 
we can write 2^|<I’W| coupled message-passing equations 
[23] 


wr ‘= 


= 

Aa 


1 




L[ n xi 


j^lJ- 


b \jed^\i 


r.<^b ( 

-a,b 


1 - 


^a,b 


l-Cu 




n vg" 




(30) 


where di indicates all the hyperedges in that contain 
vertex i, and are normalization constants. The 

estimated marginal probability that a vertex was planted in 
the group a is given by the associated belief, namely 

n ( 31 ) 

veSi 


which reduces the number of messages to kM. Still, the 
computation of the effective field ha requires the summation 
of order terms which for large hypergraphs becomes 
problematic as soon as k>2. We will show in Sec. VI that 
the computation of the field can be largely simplified when 
a specific choice of the connectivity tensor c is made. 

It can be easily verified that, if the condition Ca = cVa 
holds, then the so-called factorized solution x'a^^ = 
a fixed point of eq. (33). In order for the inference of 
the planted assignment to be easy, the above mentioned 
factorized fixed point must be unstable, guaranteeing that 
belief propagation does not remain trapped in it. 

To analyze the linear stability of the factorized fixed point 
under random perturbation of the messages. Let us consider 
messages of the form 

zr^ = «« + er^ (35) 

Plugging (35) into (33) and developing to first order we 
obtain 


By plugging the first equation of (30) into the second, we 
obtain a closed set of k\S^^^ \ equations for the x messages: 


£‘a^^^LTa, ( 36 ) 

b jedv\i 


Aci 


1 




n L 

vedi\ji b 


n xi 

Jedv\i 


]^v 


(32) 


ev ( 

Ca,b 


Nb-^) \ 


In order to reduce the number of equations we note that there 
are two different types of messages i ^ fi, namely the ones 
living on the kM edges that actually exist in G, for which 
i G dji and — I, and the “ghost” edges for which i ^ dji 
and = 0. 

It can be shown that, up to corrections that vanish when 
N ^ for the “ghost” hyper-edges we have x'a^^ — Xa^ 
while for the messages living on the real edges 


= 

Aa 


—^— n, 


n E 

vedi\fi b 





with 




1 


E E 



(33) 


(34) 


where Tab is the matrix from (19) and B is the hypergraph 
non-backtracking matrix. 

The hypergraphs generated by the model in the regime we 
are considering are sparse and locally tree-like, meaning that 
on average loops start to be observed at a distance G(logAt). 
Let us then consider a tree of depth d and observe how a 
perturbation on a leaf propagates through the (unique) path 
connecting it to the root 


5x^ 


E 


d-\ 

i=0 



( 37 ) 


Now, taking independent random perturbations, summing up 
the contributions of all the leaves and considering that in 
the limit d ^ the matrix T'^ is dominated by its largest 
eigenvalue A, we have 


5/:^ Y (38) 

deleaves 


where Sjy is the perturbation along the direction of the 
dominating eigenvector. Therefore, in terms of expectation 





























we obtain 


(39) 


E[5/] =0, 


and 

e[(5/)^] :^c‘^{k-\fX^‘^£\{5x\\f'\ . (40) 

The instability threshold is finally given by 

c{k-\)X^ = l. (41) 

When c{k— 1)X^ < 1 the factorized fixed point is stable 
and belief propagation will hence not be able to infer the 
planted assignment, and for c{k— 1)A^ > 1 belief propagation 
succeeds. 

Moreover, if we restrict ourselves to the case na = I/q 
with Cab from (9) being 


^ab — 


if a = , 
if a ^ b. 


and consequently 

“''“[Tout ifa^fi, 

then we find the two following eigenvalues 


■^1 — 7in +l)7out — Oj 

■y _ rp rp _ Qn ^OUt 

^2 — -'in -'out — 


(42) 

(43) 

(44) 

(45) 


q{k— l)c ’ 

with multiplicities of 1 and q—l respectively. Therefore the 
instability has a closed expression, namely 

= (46) 

q 

with the expected degree given by 

^^ C.n + (g-l)Cout _ 
q{k-\) 

VI. Examples 

A. Hypergraph Stochastic Block Model 

Restricting ourselves to the case (42) and by analogy with 
the k = 2 case [5], [15], we define the hypergraph stochastic 
block model (HSBM) as assortative if Cin > Cout and disas- 
sortative otherwise. A sensible choice for the parametrization 
of the problem is by e = Cout/cin and the expected degree c. 
With this notation the transition is located at 

s/^{k^)-l 


^Jc{k-\) + {q-l) 

In this section we consider the specific case 

I Qn if Q\ — Cl2 = * * * = Cl}( , 

Cout Otherwise, 


^ay-aj, 


(48) 


(49) 



Fig. 3. Performance, in terms of the overlap Q (11), of the spectral 
detection on the assortative HSBM through the non-backtracking matrix 
(NBO) compared to spectral detection with the adjacency matrix (A) and 
Bayesian belief propagation (BP). The size of the graph is N — 300000 for 
the three curves and averages are taken over 5 samples. The vertical dashed 
line marks the detectability transition. 


time and simplifies in the following way up to (9(1/A) 
corrections 

We take as an example a HSBM with k = 3, q = 3 and 
c = 4 which gives the detectability transition located at 
Ec = 0.1688. In Fig. 2 we show the spectrum of the non¬ 
backtracking operator (a) in the undetectable phase (where 
the factorized belief propagation fixed point is stable) and 
(b) in the detectable phase (where belief propagation gives 
an informative fixed point). While in the undetectable phase 
we find only the leading eigenvalue associated to the average 
excess degree, in the detectable phase two more eigenvalues 
stick out of the bulk and the correspondent eigenvectors are 
correlated with the community structure. 

In Fig. 3 we show the performance of the spectral clus¬ 
tering through the non-backtracking operator combined with 
a standard k-means. Despite a slightly worse performance, 
it displays the same phase transition as belief-propagation, 
which we conjectured to be an optimal algorithm [5]. By 
contrast, spectral clustering through the adjacency matrix has 
a comparable performance deep in the detectable phase but 
it breaks down well before the phase transition due to the 
sparsity of the hypergraph. 


B. Planted 2-in-A-sat 


for which we can define the specific parameter 

e = Cout/cin = + • (50) 

As anticipated, with this setting the computation of the 
effective field in the belief propagation iteration takes linear 


In the planted 2-in-4-sat problem, after giving a random 
binary assignment to the vertices of the network, hyperedges 
are thrown at random between groups of k = 4 vertices with a 
certain non-zero probability only if in the k-uplet there is an 
equal number of zeros and ones. At fixed expected degree c. 


















-2 




(b) 


-3 

-4 




’ 


i 

J 



-4 -2 0 2 4 6 8 10 12 


(c) 



Fig. 4. (a,b) The spectrum of the hypergraph non-backtracking matrix (1) for a hypergraph of size N — 2000 generated through the planted 2-in-4-sat 

= 4, q — 2) for (a) c — 2.5 and (b) c — 4. In (b) the informative eigenvalue lie outside the bulk on the left because the model is disassortative. In (c) the 
spectrum of the adjacency matrix for c — 4. The informative eigenvalue is hidden in the bulk. 


this generative process fits into our general formulation with 
coon = 16c, 

Cin = 2c, (52) 

Cout = 4c. 

This problem has been studied through the cavity method in 
the so called “locked” case in [24]. Here we are interested 
in the problem of hnding a configuration correlated to the 
planted one. The scenario is quite different with respect to 
the preceding case. In fact here we find two phase transitions 
in the average degree, namely one from an impossible to 
hard detection [6], [24], and a second one from hard to easy 
detection that is our main focus. In the hard phase we conjec¬ 
ture that no known algorithm is able to retrieve the planted 
assignment due to the presence of the stable factorized hxed 
point, despite the fact that the global hxed point would be the 
one at high overlap. In this case the hard/easy phase transition 
in belief propagation is discontinuous and the transition is 
located at 

cc = 3, (53) 

which is obtained by plugging (52) into (46). In this case the 
computation of the effective held in the belief propagation 
equations is reduced to linear time with 

(M) 

Since this planted problem is essentially disassortative, as 
shown in Fig. 4 the informative eigenvalue sticks out at the 
left of the spectrum. 

In Fig. 5 we show the performance of the spectral al¬ 
gorithm based on the generalized non-backtracking matrix 
compared with spectral clustering on the adjacency matrix 
and with belief propagation. The adjacency matrix displays 
a terrible behavior up to very large average degree. The non¬ 
backtracking matrix instead undergoes a phase transition, 
located at the same value of the one we encounter in belief- 
propagation but the transition is continuous. Despite the fact 
that the non-backtracking operator performs considerably 
worse than belief propagation, the important point to under¬ 
line is that the spectral method is completely non-parametric 
and we do not even need to know what the kernel is. In fact. 
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Fig. 5. Performance, in terms of the overlap Q (11), of the spectral 
detection on the planted 2-in-4-sat through the non-backtracking matrix 
(NBO) compared to spectral detection with the adjacency matrix (A) and 
Bayesian belief propagation (BP). The size of the graph is N — 200000 for 
belief propagation and N — 400000 for the spectral algorithms. In both cases 
a single sample was taken. The vertical dashed line marks the hard/easy 
transition. The adjacency matrix starts giving a non-zero overlap only at 
very high average degree, namely ai'ound c ^ 9 which is out of the plot 
range. 


we can think of the spectral method also as a way to leant the 
kernel, in order to feed it as a starting condition for the hyper¬ 
parameters in belief propagation with parameter learning. Let 
us take as an example a graph generated with c = 3.4 and 
N = 40000, after running the spectral detection we have an 
inferred labeling of each vertex. Then let us take the list of 
M = 34000 factors and look at their composition (according 
to the inferred labels). The result is shown in Table I, telling 
us that we are likely observing a planted 2-in-4-sat model or 
something very close to it. 


TABLE I 

2-IN-4 KERNEL ESTIMATION 


0000 

1/34000 

0001 

5764/34000 

0011 

21758/34000 

out 

6477/34000 

nil 

0/34000 


























VII. Conclusion 

In this paper we have proposed a spectral algorithm to 
detect hidden planted configurations in very sparse hyper¬ 
graphs, based on a generalization of the non-backtracking 
Hashimoto matrix. To test the performance of the algorithm 
we have focused on a generative probabilistic model for 
hypergraphs in which the hyperedges depend on the incident 
variables via a fixed probability kernel. Given the generative 
model, we have also derived an asymptotically (conjectured) 
optimal belief propagation algorithm and presented a deriva¬ 
tion of the non-backtracking matrix as a linearization of be¬ 
lief propagation equations around the factorized fixed point. 
In addition we have used belief propagation to compute the 
location of the detectability phase-transition. 

The generative model that we consider includes a broad 
class of problems. Among them we have studied the as- 
sortative stochastic block model and the planted 2-in-4- 
sat. In the first case we obtained that the spectral non¬ 
backtracking clustering has a performance that is very close 
to the optimal belief propagation and displays a detectability 
phase transition at the same point. It also performed much 
better than the spectral clustering based on the adjacency 
matrix that breaks down way before the phase transition due 
to the sparsity of the graph. In the second case of the planted 
2-in-4-sat we observed a different phenomenon, reminiscent 
of a first order phase transition. While for belief propagation 
the phase transition is discontinuous from a hard inference 
phase to an easy inference phase with the overlap that jumps 
from zero to a value close to one, in the spectral detection 
algorithm we observe a continuous transition at the very 
same point. Spectral detection with the adjacency matrix, 
again, performs badly up to even higher degree. In both 
cases, the gain provided by the non-backtracking approach 
was clear. 

We also showed that despite the fact that the accuracy 
of the spectral method is significantly worse w.r.t. belief 
propagation (although they start to detect assignments at 
the same values of the parameters), the spectral approach 
has many important advantages: not only it is entirely non- 
parametric, but it is also a powerful instrument to learn the 
parameters, such as the kernel, when they are unknown. 
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